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(54) Message filtering techniques 

(57) Techniques lor reducing the amount of junk e- 
mail received by a user of an e-mail system A recipient 
desa iption containing non-address information is added 
to an e-mail message. The user has an e-mail filter which 
has access to intor mation which provides a model of the 
user. The e-mail fflter uses the norv address information 
and the model information to determine whether the e- 
mai! message should be provided to the user. The e-maJ 
filter further has access to information which provides 
models of the user's correspondents, i the filer does not 
provide the message to the user, it uses ftenon-adoVess 
information and the model information of fte user's cor- 
respondents to determine who frie message might be 
forwarded Kx A sender of e-ma* can also use the model 
information of the sender's con espondertts together with 
the non-address information to determine who the mes- 
sage should be sent to. The techniques are used in a 
system for locating expertise. 
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Description 

1 Background of the Invention 
1.1 Field of the Invention 

The invention concerns electronic messaging in general 
and electronic mai in particular. 

1 2 Description o! the Prior Art 

A major annoyance in the conventional mai system is 
junk mad. As electronic mail has grown in avaflaWrty and 
popularity, junk electronic mail has become a problem as 
well. Indeed, the ease with which an e-mail message 
may be sent to many recipients may eventualy make 
junk e-mail an even worse problem that junk conven- 
tional mail. 

The prior art has attempted to deal with the junk e- 
mail problem by means of mail filters in an e-maJ recip- 
ient's local e-mail system. Such a later sorts incoming e- 
mail for the recipient into categories determined by the 
reaper* The filter simply scans each e-mail message 
as it reaches the recipient and determines what category 
Hi should be placed in. One category is of course *dfe- 
carcf. Messages which the «ter places in that category 
a/e automatically tfscarded. Prior-art liters have had 
varying degrees of rrteiSgence; some have simply 
worked with fists of source addresses and have sorted 
accorcfing to the source of the message: others have 
used keywords provided by the recipient to sort; with oth- 
ers, finaffy. the filer observes how the recipient sorts Ns 
email for awhie and is then able to sort n a simfer fash- 
ion. For details about mail liters, see Peter W. Folk and 
Susan T. Dumais. "Personafized information delivery: an 
analysis of information f rtering methods". Communica- 
tions of the ACM. vol 35. no. 12. Dec. 1992. pp. 51*0; 
p.K. Gilford, aw. Baldwin. S T. Ber&x J_M Lucassen. 
"An architecture lor large scale information systems", m 
ProceeOngs Tenth Symposium on Operating Systems 
Principles* (Ocas Island. Wash . Dec 1985). pfx 161- 
170; E. Lute. KV. KJetSt -Retzow. and K. Hoeming, 
"MAFIA - An active mal-fifter agent tor an intelligent doc- 
ument processing support*, in Mufti- User Interfaces and 
Appfcations. a Grbbs andn A. A. VerrijrvStuari Eds. 
North HoSand. 1990. pp. 16-32; T.W Malone. KR Grant. 
FA. Turbak. S A Browst M.D. Cohen. InteBigert infor- 
mation sharing systems'. Common. ACM 30. 5 (May 
1987) 390^02. S. PoflacK *A rule-based message «ler- 
ing system". ACM Trans. Off. Inf. Syst 6. 3 (July 1988). 
232-254. P. Maes. "Agents that Reduce Work and Infor- 
mation Overload". Common. ACMZI (7) (July 1994). prx 
31-40. A problem with al such liters is that sortkig for 
another person is drfficult even tor a human being, and 
rf a titer is going to be useful, it cannot do much worse 
than a human would. 

One of the reasons lor the jur* mail is that present- 
day e-maa systems require fiat recipients be adfressed 
by e-mai addresses. In order to ensure that an e-mai 



message wit reach everyone who fright possibly be 
interested in it the sender typcaty uses a 1st of 
addresses which includes those who might be interested 
but indudes marry others as wefl. For everyon^E* those 
5 actually interested, the e^nai is of course junk mail. 

What rs needed to reduce the amour* of jurfc mad ts 
a technque wtvch permrts a sender to use sornething in 
adfftion to the e-mail address to specify the kinds of peo- 
ple who are to actually receive the emad and permits a 
to titer to use the information provided by the sender to later 
the mai so that only those kinds of people actuaBy 
receive it tt is an object of foe invention (fisclosed hereto 
to provide such a technique and foereby to reduce the 
_ amount of junk e-mafl received by a user of the e^nail 
system 

2 Summary of the Invention 

The invention reduces the amount of junk e-mail 
20 received by a user of the e-mai system by addng a recip- 
ient specifier to an e-maa message. The recipient spec- 
ifier non-address information to further specify the 
recipients in the group to whom foe message is sent who 
should actualy receive tie message The mai later for 
25 a given recipient has access to information about that 
recipient and uses that information toother with the non- 
adoVess information in the e-mai message to deter mme 
whether the message should be provkfed to the given 
recipient I the norvaddress information and toe infor- 
30 mabon about the recipient irvJcate that foe given reent- 
er* should not receive the message foe filter does not 
provide it 

In another aspect of foe invenfion, foe sender-smart 
titer does the filtering. The sendenr provides a recipient 

as specTier which uses non-adr>essirifornB^tospeciiy 
potential recipients to the mai HRer. In «s aspect, how- 
ever, the sender's mai titer has access to information 
about the possfcle recipients and uses fois information 
together with the non-address informafion to determine 

40 foepotential recipients to whom the message should be 
sent 

The first and second aspects of foe invention are 
combined in a further aspect of foe invention, namely a 
system for locating experfee in foe e^nai system tofois 

45 system, foe sender species ah area of expertise by 
means of a 1st of keywords which are relevant to foe 
area. The of keywords is included in a recipient spec- 
ifier in the message. The mai later for a potential reedi- 
er* has access to the document ties of the potential 

so recipient and to a fist of the emai messages sent and 
received by the potential recipient The mai liter uses 
foe document Res to determine the recipient's areas of 
expertise If the keywords in foe recipient specifier match 
one of the areas of expertise, the mafl Uter provides the 

ss e-mai message to the rpotentialrec^^ 

titer uses foe fist of. e-mai messages to determine cor-. 
respondents of the the potential recipient who may have 
foe area of expertise speofied in foe recipient specifier 
and forwards the message to foose correspondents The 
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maH I ater of each potential recipient which actually pro- 
vides the message tp tie recipient further sends a refer- 
ral message to the sender of the message, who thus 
knows exacity who r^cwed the message. 

Other objects and advantages of the apparatus and 5 
methods disclosed herein wi be apparent to those of 
ordinary eJqI *i the art upon perusal of the following 
Drawing and Detailed Description, wherein: 

3 Brief Description of the Drawing 

FIG. 1 is a high-level block diagram of apparatus 
embodyrig the invention; 

FIG. 2 is a diagram of user model 1 13 in a preferred 
errtxxSmert; 

FIG. 3 is a Diagram of correspondent models 111 in 
a preferred erntxxjrnent; and FIG. 4 is a diagram of 
data structures used by mail filter 109 in a preferred 
errtxxfiment 

Reference numbers in the Drawing have two parts: 
the two least -significant digits are the number of an item 
in a figure; *>e remaining digits are the number of Ihe 
figure in which »>e item first appears. Thus* an Hem with 
the reference number 201 first appears in FIG. 2. 

4 Detailed Description of a Preferred Embodiment 

The following Detaied Description begins with an over- 
view of the rverrtion and then describes indetaB how the 
invention is irnpternenf ed in apparatus to locate expertise 
in an e-mail system 

Overview of the mention: FKL 1 

FKx 1 shows a high-level overview of apparatus 101 
which embocSes toe invention. Apparatus 101 rs 
employed in a network 103 which comects a number of 
users I05(a„n) Network 103 may be a network such as 
Internet or a commercial e-mai network, or it may be an 
e-mai system which rcmrnurttcates between users of a 
single computer system Each user 105 is connected to 
network 103 by means of a Bnk 107 over which user 105 
can send and receive e-mail messages. A mai item of 
the type used in the invert 

119 is a standard e-mai message except for two addi- 
tional components: 

1. recipient speedier 121 which uses non address 
information to further descrt>e the recipients who 
should receive the e-mail; and 

2. referral fist 127. which is a Sst of potential recpi- 
ents who passed the e-mai on and of recipients to 
whom ihe e-mai was provided 

Reqpient specifier 121 has two parts, recpiert type field 
123. which generaly indcates how recpient specifier 
121 is to be interpreted, and reorient descrption 125. 



which contains the non-address information which is 
actually used to determine whether mai item 119 is to 
be provided to a given recipient 

A user 105 who wishes to reduce thS^jbrount of junk 
e-mai he receives has a maH Rter 109 as part of his e- 
mai system. When an e-mai item 119 is sent to user 
105 s address, mai titer 109 interprets recipient speci- 
fier 121 to determine whether mai ftem 119 is to be pro- 
vided to user 105(n). In interpreting recipient speedier 
109, maH liter 109 employs user model 113. which is 
data that provides a model of user iG5<n). H recipient 
Description 125 spedfies a reqpient which is of the same 
kind as that specified by user model 113. mall liter 109 
adds mail item 1 19 to titer ed mai 115 and informs user 
105(n) via interactive user mail interface 1 17 that mail 
has arrived. If user I05{n) desires, mai fitter 109 can fur- 
ther use the information in refenal 1st 127 to indicate the 
chain of referrals which resulted in ihe message being 
Directed to user 105(n). In some ernbodments. mai titer 
109 may also use the information in referral list 127 to 
send a receipt 129 which ideojjfies the e-mai message, 
the chain of referrals, and user I05(n) to the original 
sender of mai item 1 1 9. 

If user modeM 13 does not specify a recent which 
is of the same kind specified by recipient descrption 1 25. 
mai filter 109 looks to corespondent models 111 to 
determine where to send mail Hem 119. There is a cor- 
respondent model 111(m) for each of user 105(n)*s fre- 
quent corespondents, and ike user model 113. each 
correspondent model 11 1(m) contains data which maD 
titer 109 can use together with recipient o^scxpbon 125 
to determine which of user 105(n)s corespondents 
should receive mai item 119. MaH filer 109 then adds 
the names and e-mai addresses of those correspond- 
ents to referral fet 127 m mad item 119 and forwards mail 
item 119 to those correspondents. V toey in turn have 
mai titers 109. they w9 also liter mai item 1 19 as just 
desenbed In a preferred errfcorJment user 105(n) may 
specify how much control he deswes over forwarding. 
FbrwarrJng may be completely automatic, or marl filer 
109 may present user 105(n) with the information from 
recpient desapt™ 125 and a 1st of the respondents 
* has found for forwarding and let user 105(n) select 
which of the correspondents is to receive the forwarded 
letter. 

If user I05(n) wishes to send an e-mai message 
with a recipient specifier 121. user 105(n) makes that 
request of mail liter 109. Mai liter 109 uses interface 
1 17 to obtain information from user I05(n) whkh H uses 
to make recipient specifier 121. Mail Bier 109 then uses 
recpient speofier 121 w»i cwresponderrt models 111 
■n the manner described above to make a Hst of the cor- 
resporxieritswrioshnuM^ 

ing on the irrpternerfetton, mai titer 109 may simply 
send the e-mai message to those corespondents or 
permit user 105(n) to selWcwespondents from the fist 
The selected correspondents wtf of course be placed on 
referral list 127. 
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In FIG 1. mai filter 109 and correspondent mode* 
111 and user model 1 13 are ai implemented in the local 
computer system used by user I05(n). Such an imple- 
mentation is advantageous in that fce information in cor- 
respondent models 111 'and user model 113 remains 5 
under the confroi ol user 105(n). In other ernbcoVnent*. 
however, mai Bter 109 may be located at any point in 
network 103. Meed, some embodiments may contain 
only correspondent models 111. For example, a data 
base of customer information might be used as a cone- to 
spondent model 111. and mai filter 1 09 might use recip- 
ient description 125 together with the data base of 
customer information to determine which customers 
should receive e-mail about a new product or service. 

15 

A System lor Locating Expertise 

The techniques descrfced above are employed in a pre- 
fer r ed errtodrnent to make a system tor locating exper- 
tise. The Mowing discussion fust explains the utRy of so 
such a system and then presents two different embodi- 
ments. 

Using a Computer to Find Information 

2$ 

There are bascaJry two ways of f kxfcng something out by 
using a computer : 'ask a program* and "ask a person". 

The first cove** all ways of accessing i nfo r mation 
stored onftne. rcJutfng the use of fractional database 
programs: fie indexing and retrieval programs such as x> 
glimpse (by US Manber at University of Arizona) or 
Apple s Apple-Search; news filtering programs such as 
Hoover (SandPbint Corp); and even more sknpty the 
use of toote such as ftp. aw\ and text editors to retrieve 
and view fies. 35 

The second. ~ask a person*, covers ways tfiat a com- 
puter can be used as a con^nunkafion n^eolum between 
people. Currently the prime examples are etecfroric 
mail irxluolngbothpersonal e-mai and ma«ng island 
bulletin boards and newsgroups. The grooving integration 40 
of computers and telephones aBows us to also view 
telephony as a computer-based ccmrrunkation 
merJunx Simple examples of such integration are tele- 
phone address book programs that run on a personal or 
pocket axnputer and o*a! numbers for you: more sophis- 45 
ticated is fie explosion in the use of computer-based 
FAX. Today it is hard to even buy a modem that does not 
have FAX capab&ty. and by far the heaviest use of FAX 
is lor person-to-person convTunication. 

There are inherent problems with both general so 
approaches to obtaining information, ft has often been 
noted that as Ihe world of online information sources 
expands. **e*ask a program* approach suffers from the 
problem of knowing where to look For example; the 
Mosaic sysSem overcomes many of the technical prob- 55 
lems in accessing a wide variety of information on the 
Internet, by automatically handing the tow level details 
of different communicaSon protocols. It is easy and 
entertaining to browse through an enormous hyperme> 



c&a space However. fixing an answer to a spoof *; ques- 
tion using Mosaic tends to be stow and frustrating, and 
often results in taSure. One response to this problem has 
been the attempt to design systems that j^orporate 
knowledge about ihe location of information, such as the 
Information ManrtoW project (by T. Kirk. A. levy, and D. 
Srrvastava. of AT&T Bel Labs). However, a deeper prob- 
lem remains, thai no solution based solely on building a 
better search- engine can address. This is the fact that 
much valuable information is simply not online, but only 
exists in peopled heads. Furthermore, there are eco- 
nomic, social, and pottical reasons that much valuable 
mtormabon wil never be made pubfidy accessible on the 
Internet or any other network. Indeed, part of the value 
of a piece of information resides in the degree to which 
it is not easily accessible. 

In any large organization* determining who is an 
expert on a particular topic is a crucial problem The need 
for expertise location ranges from informal situations, 
such as where I might need to find an expert on La Tex 
macros to hefc> foe a typesetting problem in a paper I'm 
writing, to formal consfruction of project teams to meet 
business needs. The range of expertise speculations 
may range from the generic fwho knows about togpc pro- 
gramming?") to the highly specific 0*ho knows how to 
modify the interrupt vector handling microcode in the 
reboot module of the X2Y999 processor 7% 

Onfine directories of expertise rarefy exist and when 
they do. the irtomation fiat contain is certain to be out 
of date and irxxxnplete. In fact expertise needs are 
potentiafiysospecfctTatiiss 
nine a comprehensive set of categories in a dv ance. 
Expertise location is therefore generally an "ask a per- 
son" task, with the ai the problems associated with that 
approach outfined above. 

Let us consider for a moment how expertise location 
actually works when * is successful. In a typical case I 
contact a smal set of coieagues whom I tfw* might be 
famOiar with the topic Because each person knows me 
personaBy. they are quite ikefy to respond. Usually none 
of them is exactly the person t want; however, fiey can 
refer me to someone tfiey Know who might be After fol 
towing a chain of referrals a tew layers deep I frafiy fnd 
the person I want 

Note that to this successful scenario I needed towalk 
a fine line between contacting too few people (and thus 
not finding the true expert) and contacting too many (and 
eventually making a pest of myself)- Even in the end I 
might wonder if I might not have found even a better 
expert if enfy I could have cast the net a bit wider. I may 
have had difficulty bringing to mind those people I do 
know personaly who have some expertise in the desrred 
area. If only ai of my coBeagues employed enrJessfy 
patient assistants that I could have contacted inrtiafy, 
who woukf have known something about tteir bosses* 
areas of expertise, and who could have answered my ini- 
tial queries wtthout ocsturbing everyone ... 

Now let us consider how mad raters could be used 
to augment the expert location process. Each person's 
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mail filer would create a model of thai person's areas of 
interest This model would be created automatically by 
using information retrieval (IR) techniques (such as 
inverted indexes) on afl the documents created and 
received by the user. The user model could be quite large 
and detailed, and would be private to the user, that is* 
not stored in a central database. The mail titer would 
also create a much more coarse-grained model of my 
contacts by applying simitar techniques to ail the elec- 
tronic mail that I exchange with each person. 

When I have an expertise location need. I present 
the problem to my mail liter as an unstructured text 
description. Again using IR techniques, my mai fitter 
selects a meolum-to-Jarge set of my contacts to whom 
the query may be relevant It then broadcasts the query, 
not to the people themselves, but to their mail filters. 
Upon receipt of the question, each mail filter checks if its 
owner's user model does indeed provide a good match. 
If there is a good match, the mai liter presents my 
request to its owner. H the owner's model does not match, 
but the model of one of the owner's contacts does, then 
the marl fitter can ask the owner Hit can provide a referral. 
Finally, if there is no match at aft. the query is silently 
logged and deleted. A great deal of flexMrty can be built 
into each maH titer, depending upon its owner's prefer- 
ences. For example. I might allow automatic referrals to 
be given to requests that come from my closest col- 
leagues. * 

This system provides several benefits over either 
sendmg personal e-mai to everyone in order to find an 
expert or using netnews to find the expert. First it is 
largely passive on the part of the recipients - they do not 
need to be rearing netnews and wad ng through dozens 
of articles. Second, queries are broadcast m a focused 
manner to those who are at least somewhat ikety to find 
them of interest Third, users are shielded from seeing a 
large number of completely irrelevant messages; each 
mail filter 109 may process dozens of messages lor every 
one the user sees. RnaBy. messages frat a user does 
see do not come from "out of the blue", but rather are 
tagged with a chain of referrals from colleague to col* 
league 

One reason to beBevethat the syst em just descrbed 
would be useful in practice is that it basically models Ihe 
manner m which expertise location actually works now 
(O. Krackhardt and J.R. Hanson, Informal Networks: 
The Company Behind the Chart*. Harvard Business 
Review. July August 1993). while allowing more people 
to be contacted without causing disruption and distur- 
bance. 

Implementation of an Expertise Locator 

A presentty-prefen ed entxxjrnent of the expertise loca- 
tor has been implemented using the network agents 
descrfced in Coen, et aL. Network Agents. US. Patent 
application 08/203.147. fled 2/28/94. In the implemerv 
tation. mart titer 109 is a component of a user agent 
which handles e-mai messages tor its user. Mail titers 



109 are written in the programming language Visual 
Baskx and run on a standard personal computer. Inter- 
active user mail interface 117 preset^ 9>e expertise 
locator in mail filter 109 to the user a&€h anthropomor- 

5 phic TaJfong heacT that appears in a window on the com- 
puter screen. AB the computers running mail liters 109 
are networked (cunentfy using the protocol TCP/IP), and 
can exchange electronic mail with each other and with 
any person. A mai titer 109 can also invoke other pro- 

io grams to perform various subtasks. 

Each mai titer 109 has access to two sets of data 
base hies. Ihe first set shown in FK1 2. implements cor* 
respondent models 111. the second set. shown m FIG. 
3. implements user model 113. Each of Jhe data base 

is ties in the two sets is specific to and owned by the indi- 
vidual user of mai titer 109. It is important to note that 
we do not assume that these files can be directly 
accessed by anyone other than the user and mail fitter 
109. 

20 Conespondent modets 111 contains the following 
five ties: • 

• Colleague fist 201 which contains entries 203 for 
some of the user s ccJeagues. Each entry 203 con- 

6 tains an tieritr>cation2tt 

a fcst of keywords 207 describing the colleague's 
areas of expertise. 

An Emai tie 209 which contains a* of the emai 
30 211(0 n) that the user has sent and received for a 
substantial period of time: typicaBy. the past year or 
several years. 

• An Emai inverted index fie 213 lhat has an entry 
3$ 215 for each word that appears in any emai mes- 
sage Entry 215 contains a word 217 and a Bst of the 
rumbers ol the messages in emai fie 209 toat con- 
tain that word. This kind of fie can be generated 
using standard information retrieval algorithms. 

40 such as ftose described in (G. Salton. Automatic 
Text Processing. Adtfsorv Wesley 1989). 

• AserxferAecipier* list fie 221 lhat has an entry 223 
for each message in emai fie 209. The entry con- 

45 tains fi^e identifier of the sender of fre corresponding 
message (if other than the user) or the identifier of 
the recipient of the corresponding message (if sent 
by the user). 

so RG. 3 shows the data base ties used to "wnplementuser 
model 113. 

• User expertise fist 301 is a fie containing a list of 
keywords that descnbe some of the user's own 

55 areas of expertise 

• User ties inverted index 305 is a fie containing an 
inverted index of text ties in the user's cfirectory That 
is, tor every word lhat appears in any fie the user 
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has stored on the compute*, this fie contains a list 
of the names of the ties containing that word 

In tie preferred entxxfrment. coleague list 201 and user 
expertise 6st 301 are created by mai titer 109 n inter- s 
action with user t05(n); the inverted index ties 213 and 
user files inverted index 305 are created automatically 
by mai filer 109. This kind of very large inverted index 
can be quctty created and searched by the prog/am 
•tfimpse- {U. Manber and S. VAi, X5UMPSE: A Tool to io 
Search Through Entire F3e Systems." Usenix Winter 
1994 Technical Conference. San Francisco (January 
1 994). pp. 23-32). In making inverted 1st 305. GLMPSE 
uses a UNIX operating system (UNIX is a trademark of 
XOPEN) utifity which determines whether a tie is a text is 
fie. In addition, the user can specify to GLIMPSE which 
directories ol files or individual Res are to be indexed. 

A user begins the process ol locating an expert in a 
topk:byclKidngonthewinc^kxrnaifmer 109 and typ- 
ing a phrase that descnbes the general kind of request 20 
(s uch as . 1 need to locate an expert-). Mail fitter 109 then 
prompts the user for a phrase describing the area of 
expertise. Once this is done, mai titer 109 generates 
and presents for approval a list of suggested canrJdates 
for receiving me request 2s 

The fist of candidates is generated by contorting 
names from two sources, Frst, names are added that 
appear in colleague fist 201. such that the words that 
appear in the phrase describing the expertise request 
appear in the fist of keywords 207 associated with name oo 
205. 

^Second, names are added tat result from tie fol- 
lowing computation. First for each word that appears in 
the expertise request, mail titer 109 retrieves from emai 
inverted index He 213 a fist of messages 4O3<0..n) (FIG. * 
4) containing tiat word. Next to intersection of deists 
is computed, generating a Est of messages 405 each of 
"Nch appears m every one ol the previous fists Next, 
list of messages. 405 is compared against senderTrecp- 
ientist He 221. and the total number of messages that 40 
appear in 1st of messages 405 that are from each each 
person in senderAedpent Est 221 is calculated. The 
resui is a narneAnessage number par Est 407 of pairs 
of "(person name* and 'number of messages*. Finally, Est 
407 is sorted according to *r^ <5 
20 names with the highest number of messages in this 
list are ton added to me fist of candidates. 

After the 1st of candidates has been approved by the 
user, mail titer 109 makes a reqpient specTier 121 and 
adds it to the email message Recipient specifier 121 so 
contains a redpient type request 121 wttch specifies that 
an expert is being requested and expertise descrption 
401 is used as recipient descnp6on 12S 

The message travels through to network and 
arrives at the computer systems^) of the recpients 55 
Eachrecpient maitilter lOSrxterec^iwtspecifief 121 
specrfying that an expert is t*^ requester* removes 
the e-mail message from the incorring maa stream, and 
processes i as follows: 



first to words in expertise description 401 con- 
tained in the messages recipient specifier 121 ve 
matched against the recipient's user expertise fist 301. 
n to words appear in fist 301. then ma££ter 109 
assumes that this request is appropriate for the reaper* 
to see 

n the words in the phrase do not match against tie 
contents ol user expertise 1st 301. mai liter 109 uses 
user ties inverted index fie 305 to match the phrase 
against the contents of al of the recptenfs ties whkh 
are indexed in fie 305. This matcNng can be effoenfy 
performed using the program X5UMPSE* mertaied 
above I the rx^nberof matches is greater then a thresh- 
old number (e g, more than 10 matches), the reoptenfs 
mai titer 109 determines that t*s request is Defy to be 
appropriate for me reqpient 

If to recpienTs mai filter thus determines in either 
way that the message is appropriate, rt uses user mai 
interface 117 to make the the message appear on tie 
reoptenfs computer screen. The recipient is then gmn 
to option of (i) responding affirmatively back to fie 
sender. («) responding nega&«*y back to the send**, cr 
(ii) referring the request to someone else V this fhdl 
option is selected, the reoptenfs mai titer 1 09 creates 

afetolcanrtdalerec^xer^asdescrx^arxveand^ 
process is repeated. 

As is apparent from the foregoing description, to 
preferred embocfiment of the expertise locator fecreases 
its efficiency by using two-stage correspondent mod* 
111 and user models 113. The first stage is t>e expfidt 
desenpoons of expertise contained in coleague 1st 201 
and user expertise fist 301; tie second stage is tie 
inverted indexes: inverted index 213 into emai fie 209 
and inverted index 305 into the to user's text fies. The 
aJgorHhms first use the expertise fists 201 and 301. wd 
then toy may in addtton use to inverted indexes 

Example B: Enhanced YeOow Page Service 

The general techniques descrtod;tovecanbe<9pfied 
to many deferent kinds of taste. The general approach 
is useful when to foflowing concfifons hokJ: 

1. >bu wish to contact a targe number of people; 
without necessarily broadcasting messages to eve- 
ryone in the world, to to expert tocatx* example, 
to user agent helped determine a prefirrraryfistof 
candidates based on a matching schema Other 
ways of determining whom to send to mess^e to 
are also useful. In to example below, to recipients 
are simply taken to be a fixed fist of the sender's 
friends and coBeagues. 

2. \bu want the message you send to only be seen 
bypeopletowhcniisrtveryB^to 

to avoid being dferuptrva To that end. you want to 
message you send to expfidty rxJicate to condi- 
tions under which which i shouto be latent 0 be rrf- 
cvant Note that the computation of relevancy may 
rery on irtforrnation that is pr^ate to to recpient h 
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the pr evtous example. Ihe sender indicated the gen- 
eral conditions of relevancy by recipient type field 
123 (thus indicating the general kind of processing 
to be performed by the recipient's mai liter 109) and 
the words in recipient description held 125 describ- 5 
ing the Kind of expertise required (thus providing the 
parameters to that processing). Another way of say- 
ing this is that the sender pro-actrvefy determines the 
general manner in which the message is to be em 
filtered. Note that this «s different from eartier work 10 
on mail litering. which always assumes that the 
recipient of a message is completely responsible for 
establishing the conditions lor filtering (rf any), and 
the sender is completely "passive" with regard to fi- 
tering. /5 

We illustrate these core points with the following 
"Enhanced Yellow Page" service. The basic idea is to 
provide a service that assists people in obtaining one or 
more personal recommendations about a professional 20 
service or business. The system would work as foBows. 

A customer contacts the Enhanced Yellow Page 
Service (EYPS) asking for a number of a particular serv- 
ice (e.g.. a flower delivery service; an autobody shop, a 
roofer, etc.). The contact with the EYPS could be made 2s 
by many possible means of oorrrrrrnicatioa including 
telephone, an on-line service, an internet Mosaic/HTTP 
server, or election*: mail; alternatively., the EYPS soft- 
ware and directory could even be rJstitouted to users and 
run entirely on their personal computers. 30 

The EYPS gives one or more poss^le numbers. The 
customer can then ask the EYPS to help in obtaining one 
or more personal recommendations about the service or 
business. 

To obtain the r econvnendations. the EYPS first con- 35 
siders people bom a fist of friends or colleagues of the 
customer. (One way to obtain this list is by simply asking 
the customer to register friends, family, or cofleagues but 
there are also less intrusive ways of doing this, such as 
by keeping track of people with whom the customer fre- «o 
quentfy communicates.) 

Now. the key idea is that the EYPS does not simply 
contact every person on the fcst but rather only contacts 
those people that have dealt with fre particular service 
or business number in the last couple of months. There «5 
are at least two ways in which this kind of "sender pro- 
active liter ing" can be done: 

1. The EYPS contacts mai fiter 109 for each friend 

or colleague, indicating the name and telephone so 
number for the service for which a recommendation 
is deseed. Marl filters 109 that have been trusted 
with their owner's telephone records and/or records 
of business transactions can determine whether 
their owner has dealt with that company. If so. they 55 
pass the request on to the owner. 

2. ft the EYPS has drect access to the telephone 
records of the friends and colleagues (which is the 
case if the EYPS is irrplernented by a program rurv 



r*ng in a fong<kstance network itself). fren it checks 
fre phone records itself to determine fre fcst In ends 
and coieagues that have called tha^ompany. 

Thus, instead bothering & large group of people, 
there is a careful screening to ensure that onfy those are 
contacted who have had some recent dealings with the 
particutar service or business. There are various ways 
of how tie EYPS can complete the process. The least 
intrusive way would be to simply leave a message with 
some of fre selected people saying "Mr. or Ms, X would 
be interested n any opinion or recommendation about 
service Y. Please contact X at or leave message at 
number Z. This request expires at midnight" 

Note frat this kind of "pro-active" mai filtering can 
also be mpiemenled by having the user send a message 
drecfly to someone's mad titer 109. The message 
header would include a dk ectrve saying 'pass on to user 
4 he or she has contacted service X at least twice in fre 
last frree months.' Upon receipt of the message, mail 
liter 109 wtf now filter the* message based on the 
rdrled directive. Again, note the difference with the 
current forms of mail-filtering, where filtering is under 
complete control of the recipient and fre sender does 
not grve rjrect instructions to the fitering program. 

Such a system naturally raises many privacy issues 
frat can be addressed. For examples you may not nec- 
essariy let the person seeking the recommendation 
know who gets the request for advice message. That 
way. people would not feel obliged to respond. Atea fre 
identify of the requester could be protected by simply 
having a message saying "A friend would fike an opinion 
or recommendation about service Y." In that case fre 
EYPS would onfy reveal the identity of fre requester 
once the recipient agrees to respond. 

Conclusion 

The foregoing Detailed Description has disclosed to 
rose skSed in the computer and networking arts how 
non-address recipient information in an e-mai message 
and a mad filer which includes a model of the recipient 
may be used to reduce the amount of junk e-mail 
received by fre recipient and how fre norvadoV ess recip- 
ient information and a mai fiter which includes models 
of fre sender's correspondents may be used to reduce 
fre amount of e-mai sent by a user. The Detailed 
Cescrzxon has further disclosed how the above tech- 
niques may be used to construct an expertise locator and 
has dtedosed the best mode presently known to fre 
inventors for implementing the expertise locator. 

twi be rnmedtatefy apparent to those skaed in the 
computer and networking arts that the principles of fre 
rvention may be used in any Situation where a mail filer 
has access to information which enables it to respond to 
non-address information about the potential reopents of 
an e-mai message, ttwil be further apparent thai many 
techniques may be used to construct models of fre cor- 
respondents and recipients for use 
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models may be simple fists of keywords, they may be 
inverted ties, they may be data bases, or they may be 
any other arrangement oTdala which permits t>e mail 
filer lo determine from the model and the non-address 
information whether the potential recipient srioutd acto- 5 
ally receive the message, lit wit further be apparent to 
those skilled in the art that the location of t>e ma* Rter 
in the network is a matter ol design choice F*ers which 
are located on the same computer system as the recip- 
ient have better access to recipient information, whie w 
those which are located closer to the sender are more 
efficient at reducing the total amount of network fcatfc 
An of the above being the case, the foregoing 
Detailed Description is to be understood as being in 
every respect fflustrative and exemplary, but not restric- 1$ 
trve. and the scope ol the invention disclosed herein is 
not to be determined from the Oetaied Description, but 
rather from the claims as interpreted accordvig toh>U 
breadth permitted by the law. 

20 

Claims 

1. Apparatus for automabcaHy Imrting the recipients of 
a message sent via a mai system knplernentedina 
computer system, the apparatus comp risi n g: 2$ 

recipient specifying means in the message 
which uses non-address information to specify the 
recipients of ftetmessage: and message fatering 
means in the computer system having access to 
recipient information contained therein about at 00 
least one of me potential recipients and rcfurJng 
means responsive to the new address irtormation 
and to the reqpient information for provkfing the 
message to the potential recipient 9 thenon-address 
information and «ie recipient information together as 
indicate that the potential recipient is to receive the 
message. 

2. The apparatus set forth in daim 1 further compris- 
ing: ^ 

means for analyzing ties belonging to the 
potential r eopient to obtain the recipient rtormation 

3. The apparatus set forth in daJmlwheren: 

the non-address information species an 45 
area of expertise: and me recipient informaBon rxS- 
cates an area of expertise of the potential recipient 

4 - The apparatus set forth in claim 1 wherein 

*>e message filtering means is associated so 
with a sender of the message. 

5. The apparatus set tonh in dairn 1 where*v 

the message fltering means is associated 
wfththepotenSal recipient of the message 55 

6. The apparatus set forth in darrn 1. claim 2. daim 3. 
claim 4. or claim 5 whereat: 

the recipient information includes further 



recipient Mormafen about other potential reori- 
ents; and 

the message TOering means further uses the 
non address information and the rurfl^Sedpient 
information to determine which of the otfw potential 
recipients is to receive the message. 

7. The apparatus set torfi in claim 1 wherein: 

the message Wtering means further mdudes 
means for senrjng a referral message to a source 
cf the message when tfie message fltehng means 
provides the message to the user. 

a. The apparatos set for* in daim 7 whererc 

the referral message contains an identifica- 
tion of the user. 

9. The apparatus set forth in daim 7 wherein: 

the message is received by a pforafity of 

users; 

the message includes information specifying 
the users who received the message: and 

the referral message further contains the 
information spedrying the users who received the 
message 

ia The apparahis set forti in daim 6 when** 

the message includes information specrTying 
frie users who received the message; and 

the message titration means adds informa- 
tion about the o*» potential recipients to the nfor- * 
matron speciying tie users who received the 
message 
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